East Azerbaijan Province
BLIP-FusePPO: A Vision-Language Deep Reinforcement Learning Framework for Lane Keeping in Autonomous Vehicles
Miangoleh, Seyed Ahmad Hosseini, Aghdasian, Amin Jalal, Abdollahi, Farzaneh
In this paper, we propose Bootstrapped Language-Image Pretraining-driven Fused State Representation in Proximal Policy Optimization (BLIP-FusePPO), a novel multimodal reinforcement learning (RL) framework for autonomous lane-keeping (LK), in which semantic embeddings generated by a vision-language model (VLM) are directly fused with geometric states, LiDAR observations, and Proportional-Integral-Derivative-based (PID) control feedback within the agent observation space. The proposed method lets the agent learn driving rules that are aware of their surroundings and easy to understand by combining high-level scene understanding from the VLM with low-level control and spatial signals. Our architecture brings together semantic, geometric, and control-aware representations to make policy learning more robust. A hybrid reward function that includes semantic alignment, LK accuracy, obstacle avoidance, and speed regulation helps learning to be more efficient and generalizable. Our method is different from the approaches that only use semantic models to shape rewards. Instead, it directly embeds semantic features into the state representation. This cuts down on expensive runtime inference and makes sure that semantic guidance is always available. The simulation results show that the proposed model is better at LK stability and adaptability than the best vision-based and multimodal RL baselines in a wide range of difficult driving situations. We make our code publicly available.
- Asia > Middle East > Iran > Tehran Province > Tehran (0.05)
- Asia > Middle East > Iran > East Azerbaijan Province > Tabriz (0.04)
- North America > Canada > Quebec > Montreal (0.04)
- (2 more...)
- Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)
Synthesizing Agentic Data for Web Agents with Progressive Difficulty Enhancement Mechanisms
Pandit, Shrey, Nguyen, Xuan-Phi, Ming, Yifei, Xu, Austin, Wang, Jiayu, Xiong, Caiming, Joty, Shafiq
Web-based 'deep research' agents aim to solve complex question - answering tasks through long-horizon interactions with online tools. These tasks remain challenging, as the underlying language models are often not optimized for long-horizon reasoning and exploration. Prior work has proposed workflows for constructing instruction-tuning datasets, often leveraging knowledge graphs. However, such methods typically lack fine-grained control over difficulty and quality, yielding synthetic data that falls short of capturing the complexity required for long-horizon reasoning. Furthermore, many studies conflate data and training effects by comparing models trained under different optimization recipes, making it difficult to isolate and evaluate the effectiveness of the data itself. We introduce a two-pronged data synthesis pipeline that generates question - answer pairs by progressively increasing task complexity until a frontier baseline web agent fails. The baseline agent plays multiple roles in this process: attempting the questions, validating factuality, checking for alternative answers, and enforcing filtering. To evaluate the effectiveness of our synthesis methods, we adopt a controlled training setup based on distillation from strong web agents. Experiments across multiple web-based benchmarks show that our dataset - despite being smaller - enables the training of more effective web agents than existing datasets. In particular, our data exhibits twice the diversity in tool-use actions, allowing models trained on it to achieve stronger performance while avoiding repetitive tool-calling behaviors.
- North America > United States > California > Santa Clara County > Palo Alto (0.04)
- North America > United States > Wisconsin > Dane County > Madison (0.04)
- Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
- Asia > Middle East > Iran > East Azerbaijan Province > Tabriz (0.04)
- Research Report (0.50)
- Workflow (0.48)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.96)
- Information Technology > Communications > Web (0.90)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.67)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)
OntoAligner Meets Knowledge Graph Embedding Aligners
Giglou, Hamed Babaei, D'Souza, Jennifer, Auer, Sören, Sanaei, Mahsa
Ontology Alignment (OA) is essential for enabling semantic interoperability across heterogeneous knowledge systems. While recent advances have focused on large language models (LLMs) for capturing contextual semantics, this work revisits the underexplored potential of Knowledge Graph Embedding (KGE) models, which offer scalable, structure-aware representations well-suited to ontology-based tasks. Despite their effectiveness in link prediction, KGE methods remain underutilized in OA, with most prior work focusing narrowly on a few models. To address this gap, we reformulate OA as a link prediction problem over merged ontologies represented as RDF-style triples and develop a modular framework, integrated into the OntoAligner library, that supports 17 diverse KGE models. The system learns embeddings from a combined ontology and aligns entities by computing cosine similarity between their representations. We evaluate our approach using standard metrics across seven benchmark datasets spanning five domains: Anatomy, Biodiversity, Circular Economy, Material Science and Engineering, and Biomedical Machine Learning. Two key findings emerge: first, KGE models like ConvE and TransF consistently produce high-precision alignments, outperforming traditional systems in structure-rich and multi-relational domains; second, while their recall is moderate, this conservatism makes KGEs well-suited for scenarios demanding high-confidence mappings. Unlike LLM-based methods that excel at contextual reasoning, KGEs directly preserve and exploit ontology structure, offering a complementary and computationally efficient strategy. These results highlight the promise of embedding-based OA and open pathways for further work on hybrid models and adaptive strategies.
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- Asia > Middle East > Iran > East Azerbaijan Province > Tabriz (0.04)
- Europe > Germany > Lower Saxony > Hanover (0.04)
- (2 more...)
Benchmarking ChatGPT and DeepSeek in April 2025: A Novel Dual Perspective Sentiment Analysis Using Lexicon-Based and Deep Learning Approaches
Alhusseini, Maryam Mahdi, Feizi-Derakhshi, Mohammad-Reza
This study presents a novel dual-perspective approach to analyzing user reviews for ChatGPT and DeepSeek on the Google Play Store, integrating lexicon-based sentiment analysis (TextBlob) with deep learning classification models, including Convolutional Neural Networks (CNN) and Bidirectional Long Short Term Memory (Bi LSTM) Networks. Unlike prior research, which focuses on either lexicon-based strategies or predictive deep learning models in isolation, this study conducts an extensive investigation into user satisfaction with Large Language Model (LLM) based applications. A Dataset of 4,000 authentic user reviews was collected, which were carefully preprocessed and subjected to oversampling to achieve balanced classes. The balanced test set of 1,700 Reviews were used for model testing. Results from the experiments reveal that ChatGPT received significantly more positive sentiment than DeepSeek. Furthermore, deep learning based classification demonstrated superior performance over lexicon analysis, with CNN outperforming Bi-LSTM by achieving 96.41 percent accuracy and near perfect classification of negative reviews, alongside high F1-scores for neutral and positive sentiments. This research sets a new methodological standard for measuring sentiment in LLM-based applications and provides practical insights for developers and researchers seeking to improve user-centric AI system design.
- Asia > Middle East > Iraq > Baghdad Governorate > Baghdad (0.04)
- Asia > Middle East > Iran > East Azerbaijan Province > Tabriz (0.04)
- North America > United States > New York (0.04)
Advanced Deep Learning Techniques for Classifying Dental Conditions Using Panoramic X-Ray Images
Golkarieh, Alireza, Kiashemshaki, Kiana, Boroujeni, Sajjad Rezvani
--This study aimed to develop and evaluate multiple deep learning approaches for automated classification of dental conditions in panoramic radiographs, comparing the performance of custom convolutional neural networks (CNNs), hybrid CNN-machine learning models, and fine-tuned pre-trained architectures for detecting fillings, cavities, implants, and impacted teeth. A dataset of 1,512 panoramic dental X-ray images containing 11,137 annotations across four dental conditions was employed, with class imbalance addressed through random down-sampling to create a balanced dataset of 894 samples per condition. Multiple computational approaches were implemented and evaluated using 5-fold cross-validation, including a custom CNN architecture, hybrid models combining CNN feature extraction with traditional machine learning classifiers (Support V ector Machine, Decision Tree, and Random Forest), and three fine-tuned pre-trained architectures (VGG16, Xception, and ResNet50). Performance evaluation was conducted using standard classification metrics including accuracy, precision, recall, and F1-score. The hybrid CNN-Random Forest model achieved the highest performance with 85.4 2.3% accuracy, representing an 11 percentage point improvement over the custom CNN baseline (74.29%). Among pre-trained architectures, VGG16 demonstrated superior performance with 82.3 2.0% accuracy, followed by Xception (80.9 2.3%) and ResNet50 (79.5 2.7%). The CNN+Random Forest model exhibited exceptional performance for fillings detection (F1-score: 0.860 0.033) and maintained balanced classification across all dental conditions. Systematic misclassifica-tion patterns were observed between morphologically similar conditions, particularly cavity-implant and cavity-impacted tooth categories, highlighting the inherent challenges in distinguishing overlapping dental pathologies. Hybrid CNN-based approaches, particularly the combination of CNN feature extraction with Random Forest classification, provide enhanced discriminative capability for automated dental condition detection compared to standalone architectures.
- North America > United States > Ohio (0.04)
- Asia > Middle East > Iran > East Azerbaijan Province > Tabriz (0.04)
- North America > United States > Michigan > Oakland County > Rochester (0.04)
- Europe > Finland > Uusimaa > Helsinki (0.04)
- Research Report > Experimental Study (1.00)
- Research Report > New Finding (0.68)
Deep Learning-Based Forecasting of Boarding Patient Counts to Address ED Overcrowding
Vural, Orhun, Ozaydin, Bunyamin, Booth, James, Lindsey, Brittany F., Ahmed, Abdulaziz
This study presents a deep learning-based framework for predicting emergency department (ED) boarding counts six hours in advance using only operational and contextual data, without patient-level information. Data from ED tracking systems, inpatient census, weather, holidays, and local events were aggregated hourly and processed with comprehensive feature engineering. The mean ED boarding count was 28.7 (standard deviation = 11.2). Multiple deep learning models, including ResNetPlus, TSTPlus, and TSiTPlus, were trained and optimized using Optuna, with TSTPlus achieving the best results (mean absolute error = 4.30, mean squared error = 29.47, R2 = 0.79). The framework accurately forecasted boarding counts, including during extreme periods, and demonstrated that broader input features improve predictive accuracy. This approach supports proactive hospital management and offers a practical method for mitigating ED overcrowding.
- North America > United States > Alabama > Jefferson County > Birmingham (0.05)
- North America > United States > Florida > Pasco County > Holiday (0.04)
- Asia > Middle East > Iran > East Azerbaijan Province > Tabriz (0.04)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (0.68)
Flatness-based Finite-Horizon Multi-UAV Formation Trajectory Planning and Directionally Aware Collision Avoidance Tracking
Jond, Hossein B., Beaver, Logan, Jiroušek, Martin, Ahmadlou, Naiemeh, Bakırcıoğlu, Veli, Saska, Martin
Optimal collision-free formation control of the unmanned aerial vehicle (UAV) is a challenge. The state-of-the-art optimal control approaches often rely on numerical methods sensitive to initial guesses. This paper presents an innovative collision-free finite-time formation control scheme for multiple UAVs leveraging the differential flatness of the UAV dynamics, eliminating the need for numerical methods. We formulate a finite-time optimal control problem to plan a formation trajectory for feasible initial states. This optimal control problem in formation trajectory planning involves a collective performance index to meet the formation requirements to achieve relative positions and velocity consensus. It is solved by applying Pontryagin's principle. Subsequently, a collision-constrained regulating problem is addressed to ensure collision-free tracking of the planned formation trajectory. The tracking problem incorporates a directionally aware collision avoidance strategy that prioritizes avoiding UAVs in the forward path and relative approach. It assigns lower priority to those on the sides with an oblique relative approach, disregarding UAVs behind and not in the relative approach. The high-fidelity simulation results validate the effectiveness of the proposed control scheme.
- Europe > Czechia > Prague (0.04)
- Asia > Middle East > Republic of Türkiye > Aksaray Province > Aksaray (0.04)
- North America > United States > Virginia > Norfolk City County > Norfolk (0.04)
- (5 more...)
- Transportation (1.00)
- Information Technology > Robotics & Automation (0.34)
- Aerospace & Defense > Aircraft (0.34)
Scoring the Unscorables: Cyber Risk Assessment Beyond Internet Scans
Sarabi, Armin, Karir, Manish, Liu, Mingyan
In this paper we present a study on using novel data types to perform cyber risk quantification by estimating the likelihood of a data breach. We demonstrate that it is feasible to build a highly accurate cyber risk assessment model using public and readily available technology signatures obtained from crawling an organization's website. This approach overcomes the limitations of previous similar approaches that relied on large-scale IP address based scanning data, which suffers from incomplete/missing IP address mappings as well as the lack of such data for large numbers of small and medium-sized organizations (SMEs). In comparison to scan data, technology digital signature data is more readily available for millions of SMEs. Our study shows that there is a strong relationship between these technology signatures and an organization's cybersecurity posture. In cross-validating our model using different cyber incident datasets, we also highlight the key differences between ransomware attack victims and the larger population of cyber incident and data breach victims.
- North America > United States > Michigan > Washtenaw County > Ann Arbor (0.14)
- Europe > Netherlands > South Holland > Rijswijk (0.04)
- Asia > Middle East > Iran > East Azerbaijan Province > Tabriz (0.04)
How Managers Perceive AI-Assisted Conversational Training for Workplace Communication
Wilhelm, Lance T., Ding, Xiaohan, Knutsen, Kirk McInnis, Carik, Buse, Rho, Eugenia H.
Effective workplace communication is essential for managerial success, yet many managers lack access to tailored and sustained training. Although AI-assisted communication systems may offer scalable training solutions, little is known about how managers envision the role of AI in helping them improve their communication skills. To investigate this, we designed a conversational role-play system, CommCoach, as a functional probe to understand how managers anticipate using AI to practice their communication skills. Through semi-structured interviews, participants emphasized the value of adaptive, low-risk simulations for practicing difficult workplace conversations. They also highlighted opportunities, including human-AI teaming, transparent and context-aware feedback, and greater control over AI-generated personas. AI-assisted communication training should balance personalization, structured learning objectives, and adaptability to different user styles and contexts. However, achieving this requires carefully navigating tensions between adaptive and consistent AI feedback, realism and potential bias, and the open-ended nature of AI conversations versus structured workplace discourse.
- North America > United States > Virginia > Montgomery County > Blacksburg (0.76)
- North America > United States > California > San Francisco County > San Francisco (0.14)
- North America > Canada > Ontario > Toronto (0.14)
- (11 more...)
- Questionnaire & Opinion Survey (1.00)
- Personal > Interview (1.00)
- Instructional Material (1.00)
- Research Report > New Finding (0.93)
- Education > Educational Technology > Educational Software > Computer Based Training (0.46)
- Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.45)
- Education > Educational Setting (0.45)
Optimizing Urban Critical Green Space Development Using Machine Learning
Ganjirad, Mohammad, Delavar, Mahmoud Reza, Bagheri, Hossein, Azizi, Mohammad Mehdi
This paper presents a novel framework for prioritizing urban green space development in Tehran using diverse socio-economic, environmental, and sensitivity indices. The indices were derived from various sources including Google Earth Engine, air pollution measurements, municipal reports and the Weather Research & Forecasting (WRF) model. The WRF model was used to estimate the air temperature at a 1 km resolution due to insufficient meteorological stations, yielding RMSE and MAE values of 0.96°C and 0.92°C, respectively. After data preparation, several machine learning models were used for binary vegetation cover classification including XGBoost, LightGBM, Random Forest (RF) and Extra Trees. RF achieved the highest performance, exceeding 94% in Overall Accuracy, Recall, and F1-score. Then, the probability of areas lacking vegetation cover was assessed using socio-economic, environmental and sensitivity indices. This resulted in the RF generating an urban green space development prioritization map. Feature Importance Analysis revealed that the most significant indices were nightly land surface temperature (LST) and sensitive population. Finally, the framework performance was validated through microclimate simulation to assess the critical areas after and before the green space development by green roofs. The simulation demonstrated reducing air temperature by up to 0.67°C after utilizing the green roof technology in critical areas. As a result, this framework provides a valuable tool for urban planners to develop green spaces.
- Asia > Middle East > Iran > Tehran Province > Tehran (0.27)
- Oceania > Australia > Western Australia > Perth (0.14)
- Asia > India > Maharashtra > Mumbai (0.04)
- (19 more...)
- Research Report > New Finding (1.00)
- Workflow (0.92)
- Health & Medicine > Therapeutic Area (1.00)
- Health & Medicine > Public Health (1.00)
- Energy > Renewable (0.94)
- (6 more...)